PlantTFDB
Plant Transcription Factor Database
v4.0
Previous version: v3.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Vocar.0017s0086.2.p
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Chlorophyta; Chlorophyceae; Chlamydomonadales; Volvocaceae; Volvox
Family CPP
Protein Properties Length: 2239aa    MW: 221808 Da    PI: 7.0325
Description CPP family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Vocar.0017s0086.2.pgenomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1TCR44.33.5e-1410701109242
                  TCR    2 ekkgCnCkkskClkkYCeCfaagkkCseeCkCedCkNkeek 42  
                           ++k+C+Ckks+Clk+YC+Cfaag++C++ C+C +C+N+ e+
  Vocar.0017s0086.2.p 1070 SSKSCRCKKSQCLKLYCDCFAAGQYCGS-CSCISCHNRPEH 1109
                           689*************************.********9875 PP

2TCR46.38.3e-1511411179139
                  TCR    1 kekkgCnCkkskClkkYCeCfaagkkCseeCkCedCkNk 39  
                           k+k+gCnC+ks+ClkkYCeC++ g+kC+ +C+C +C+N 
  Vocar.0017s0086.2.p 1141 KHKRGCNCRKSHCLKKYCECYQGGVKCGIQCTCMECENM 1179
                           589***********************************7 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SMARTSM011149.5E-1410691109IPR033467Tesmin/TSO1-like CXC domain
PROSITE profilePS5163430.72210701181IPR005172CRC domain
PfamPF036381.3E-1110721106IPR005172CRC domain
SMARTSM011141.4E-1311411182IPR033467Tesmin/TSO1-like CXC domain
PfamPF036384.7E-1111431179IPR005172CRC domain
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0044212Molecular Functiontranscription regulatory region DNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 2239 aa     Download sequence    Send to blast
MRRSSPPREP GVGPEGAKAS ARPGLRTVPT MPNKRQAIQG YDVDNSLGSP LFPRDYDQLR  60
NLLGSPLPPL HSPPLFQPSP RRSILTSPAR PAANQRPSNQ IPSNNSHRHN PDSMDALASF  120
FAPSPALPVP LFSPSAAAPN SLFDTSVNRA RADVFTPTPC KSGALDHVTA LLHDVGRGGQ  180
SGAGSDPIGV HLQHQRCNND SAVPTNAATD PHHHEGGSGG SGSGSGSGGT AIPVNVASAS  240
ASAGISCTAQ LAAAALRSQA NKASSSHGFH MIHDGGFGFG FGRSQATPGL MLSLSGQGGF  300
GPIHHPVPGY PSDLLGISGP GSHMFGGSGG AAAGIDGGSG NGGGLYGSFC GIANGGGGGC  360
AGLQLNVKDM LLAHHDDDDR SGGLLTSMPS FCGGGGSGGG GGGFGSGLGL GRPRSRYLDF  420
CTTPRHSADA AASGGAAAAP DATSGDGARA GGSGSSGGVA SNVGAGGGSG SSVGIGEGVG  480
VVGSVEGGRC GGGGIYPGGL SSRHHGGGLL LAPQLPAPPL LQTETNTATT GSNTGQHSRG  540
ADGAAESPPE SLDEPQKPVL SYPSPNEETR IAIAGGCVKV VESGASRTTV ETPGVELGCG  600
PSGGGGGASG GITSTEVAAG PAAAATGTER MATVSASAAG GGGGGGSNGV LVPSSGGSFG  660
LRTESLLVLP GTSVAVPGGG GGGLMVPVSG SGRGGGGNDG GGCDLSSSME ADTMQYSAGG  720
RGSGAGGVCL DRDLGMSSPS LLPPPPLSLG GLMPSSFSVP PHLQPNHHHL QHHHHHHHHQ  780
QQQQQQHYQS SAMTSSMMDL SMARGGSGAG TGASTLMMVS EGGGAMPGLQ PLSQMQMPLC  840
EDDKSFVKRQ IQQQQMQQQQ MQQQQMQQQQ PLVRGGSGTF AVRMASGGSG AGGGGGGGVN  900
GSSGGLLQGR DAAAAVAASG GCERPGVGLP PRSGGGGGGG INVMPGGGMP GAAVAAASVG  960
GAGGGGAGNG GASTPSTLQR PQRTRTASSY GGGGAGGAMN EGTVMSMRGA PSMDFDVVVP  1020
ELELSPDFPG RGGINANANP NAHRSSGAGG GLTQIQGGGP NRGRRTSENS SKSCRCKKSQ  1080
CLKLYCDCFA AGQYCGSCSC ISCHNRPEHA DRVLQRREDI AARDPQAFTR KIQLAPNGNG  1140
KHKRGCNCRK SHCLKKYCEC YQGGVKCGIQ CTCMECENMD VGSSQEGAGA RGALKRGGAA  1200
AKGAGGRAGG GGGGGGSRAG SRRSSATGMY DDYAPSPPLP STSGCSDGPS PTPSQGTVPG  1260
SVMLQPPPPL ASMPSLTVAA AAAAAAAATT AATTNHFAMS LGSGGDGAAG CTASMPYGGG  1320
HALSAGVVQF SEDGTVRRNS TNSLSHSQAP PVAQPPQLLP PSQQQQQQLQ SQMQSMPAPL  1380
PPNFLRQQQQ QEVQLQPMHS HPHHQQQQQE QQESPSLICG EMLQQQQQQQ QQQQQQQQQQ  1440
QQQQQQQQQQ QQQQQQQQQY YQQQQMVKRS LPPELYGSGS GSDAVARDTC CRGDGGDGDG  1500
EILPGNLRDF QGVVRDEMDE DAEEEEEEEE GDGPSQEQLG PLKRRRKQEL GRRTAATPLP  1560
LPSDHPTAPT SESSALATGG TWAEEARNSA GCNNRRGAAA TVAAAHASDN NADNAYPRTE  1620
GGGMGDMTLA AVGTAGGDLG PSGAGAAAAM AAAAAMPPPP SGQDLRFSLG PEPPGFTPRG  1680
LGISSLDVVS PPPLSMLTHL ESDTDSDGGG LEGAGGGLGC RPRRRSAQHQ YRQQYMNTGG  1740
AAVAAAAGGG GGGGGGAADV PHPSALRRNG GSRHQSHGMV GLDVGVMDFD DSSSALADAM  1800
ITAIADEASR GPMAGGGECV ATTAGAAGTA AAAAAAAAAA GRHQHRQSGE AAAAAGSDAA  1860
TAREGGGVLL CGELLGSGCL LDDGSNDMFL AGFEPNSVEG ARGCGSFGFG SGGSGGGFLS  1920
PRFGGMGGGN SSFGLATSPT AFPRSGGVNG SLGLCAVSPQ WRVRPPGLGP MGEVAAAVGP  1980
PLGLMSSNSW LHLPHRRPSR FAPTRVNGGS GGGGSSAMSY DPSQLPPWPP VASVTTCTVG  2040
GPLVPELCGL DSGLALKGGH LDMGRAGAAA AAAASVHTLV SPVRTSAMSA AALARRRETG  2100
GPEGDASRAP SYQGAPSGCG DEQRESGPWV VPAAHHHLEA ASSPSKQQRC FMATAAGGGG  2160
NLAAPLQLPQ PSAMTPGEQP QFDILTGGSG GGHPRTQPPG GGRGGSRNGG GCAGAAGGGA  2220
SANRPPRAGS FVADNGAA*
Nucleic Localization Signal ? help Back to Top
NLS
No. Start End Sequence
112041212GGRAGGGGG
Binding Motif ? help Back to Top
Motif ID Method Source Motif file
MP00624PBMTransfer from PK22848.1Download
Motif logo
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMap-Retrieve
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_002953265.10.0hypothetical protein VOLCADRAFT_94037
TrEMBLD8U3R60.0D8U3R6_VOLCA; Putative uncharacterized protein
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G22760.15e-29Tesmin/TSO1-like CXC domain-containing protein